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DIGITAL SAMPLING INSTRUMENT EMPLOYING CACHE MEMORY 

By: David P. Rossum 

CROSS-REFERENCE TO R ELATED APPLICATIONS 

The present application is a continuation-in-part of 
U. S. co-pending application serial number 07/462,392, filed 
January 5, 1990, entitled Digital Sampling Instrument for Digital 
Audio Data, and which will issue as U.S. Patent No. 5,111,727 on 
May 12, 1992. 



BACKGROUND OF THE INVENTION 

The present invention may be efficiently implemented in 



3 

:i 
m 

Q 

UlO a single VLSI integrated circuit of low cost. The present 

m 

Sj invention provides for a very high channel count, limited only be 

the speed and cost of the circuit and the average degree of 
upward pitch shifting required. Also, the present invention 
allows for use of multiple interpolator circuits to be used with 

o 

ffj.5 a single waveform memory. 

The present invention relates to electronic musical 
instruments, and more particularly to digital sampling 
instruments which create musical notes by reproducing recorded 
waveforms of musical instruments or sound effects, or 
2 0 mathematically calculated waveforms from a waveform memory at a 
variable playback rate. As has been previously disclosed in the 
above-identified cross-referenced patent application Serial No. 
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07/462,392 filed January 5, 1990 one technique for improving the 
performance of such instruments is the use of a cache memory and 
waveform interpolation. Such a technique increases the available 
channel count of the instrument by eliminating the waveform 
memory access time bottleneck which limits performance. While 
the basic use of cache memory has been previously described, 
there are several improvements beyond the preferred embodiment 
described in patent application Serial No. 07/462,392 which are 
described herein. 

summary of the Invention 

It is an object of the present invention to provide improved 
N point interpolation variable pitch playback of musical sounds. 
The techniques so described can be efficiently implemented in the 
preferred embodiment by a single VLSI circuit of low cost. The 
preferred embodiment allows a very high channel count which will 
support many simultaneous musical notes. It also allows memory 
systems having variable access times, as well as the use of more 
than one interpolator circuit or chip with a single sound 
waveform memory. 

Further described are techniques to optimize the 
performance of the system by decreasing the complexity of the 
VLSI circuit required to implement cache memory. Another 
technique described improves the cache system by increasing the 
degree to which upward pitch shifting on a minority of the 
channels can occur. 
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Other objects, features and advantages of the present 
invention will become apparent from the following detailed 
description when taken in conjunction with the accompanying 
drawings . 

5 

Brief Descri ption of the Drawings 

The accompanying drawings, which are incorporated in and 
form a part of this specification, illustrate embodiments of the 
invention and, together with the description, serve to explain 
MlO the principles of the invention: 

0 

q Figure 1 depicts a block diagram of an interpolator. 

f- 

Figure 2 depicts a block diagram of an interpolator 
i with cache memory. 

M Figure 3 depicts a block diagram of the present 

13 15 invention. 

Figure 4 depicts details of the address update unit of 

jj Figure 3. 

PJ Figure 5 depicts details of the address register file 

of Figure 3. 

20 Figure 6 depicts details of the priority unit of Figure 

3. 

Figure 7 depicts details of the memory access unit of 

Figure 3 . 

Figure 8 depicts details of the cache memory unit of 

25 Figure 3. 
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Figure 9 depicts details of the convolution unit of 

Figure 3 . 

Figure 10 depicts a block diagram of a shared memory 

system. 

Figure 11 depicts a block diagram of the use of 
multiple waveform memory types. 

Figure 12 depicts a reservation table for cache RAM. 

Detailed Description of t he Preferred Embodiment 

Reference will now be made in detail to the preferred 
embodiments of the invention, examples of which are illustrated 
in the accompanying drawings. While the invention will be 
described in conjunction with the preferred embodiments, it will 
be understood that they are not intended to limit the invention 
pl5 to those embodiments. On the contrary, the invention is intended 
to cover alternatives, modifications and equivalents, which may 
be included within the spirit and scope of the invention as 
defined by the appended claims. 

The fundamental architecture of a sampling digital 
instrument which supports L channel pitch shifting by 
interpolation, as described in patent application Serial No. 
07/462,392 is shown in figure 1 and includes a waveform memory 1, 
an address update unit 2 supplying an integer part 3 and a 
fractional part 4 of a current address for each of L channels, a 
memory address generator 5 producing waveform memory addresses 6 
from the integer part 3 of the current address and a convolution 
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unit 7 producing output samples depending on the fractional 
address 4 and the data 8 from waveform memory 1. For each output 
sample and each of the L channels, the address update arithmetic 
circuit 10 provides a new current memory address consisting of an 
5 integer 3 and fractional 4 part which is created from the 

previous current address stored in register file 9 by adding a 
phase increment which is also stored in register file 9, and the 
convolution unit's multiply-accumulator circuit 12 computes a sum 
of products of samples located in the waveform memory times 
NlO coefficients (which depend on the fractional part of the memory 
address) . This can be represented by the formulas: 

I 

0 A n = A n . x + P 

'i\ Yi+f - X l£ *C 0 (f) +X,a I 3C 1 (f)+. • .+X i Cn-l(f)+. . .+X,.n-iC n (f) 

2 2- 2 2 

where An is the new current address, An-1 is the previous current 
address, P is the phase increment, Yi+f is the output signal 
f!jL5 representing the current address with integer part i and 

fractional part f , Xm represents the waveform memory sample at 
address m, and Cn(f) represents the nth coefficient which is a 
function of f. 

The coefficients Cn(f) are computed in the coefficient 
20 generator 11 either algorithmically from the fractional address 

f, by a table lookup based on f, or by a combination of these two 
approaches, such as linear interpolation of a limited size table. 
The output signals Y are digital output data 13 ready to be 
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further processed by additional circuitry or by time domain 
multiplexing of some of the existing blocks. 

The current memory address A for any given channel is 
continually being increased by the phase increment P at each 
5 output sample period. The magnitude of the phase increment 
determines the pitch shift from the original pitch, with an 
increment of unity being no shift, and increments smaller than 
unity shifting the pitch downward. As described in patent 
application Serial No. 07/462,392, the phase increment is most 

Mio commonly less than one. Consequently, the number of accesses of 

fl waveform memory can be reduced by the use of a cache memory. 

Sj This will in turn increase the number of channels which can be 

|[ supported by a single waveform memory. 

SJ A cache memory system is shown in figure 2. In this 

pis system, not only does address update unit 2 produce an integer 3 

ft 

fl! and fractional 4 part of a new current address, it also produces 



a cache data required size (S) 14, defined as: 



0 

W S - lnt(A n )-Int(A n . 1 ) 



where Int(x) is the integer part of x. 

The cache memory 15 contains M entries for each 
channel, and M must be an even power of two. A minimum of N 
entries is required for N point interpolation, that is M £ N. 
Thus, for example, linear interpolation (N=2) requires 2 or more 
entries per channel, and eight point interpolation M is greater 
than or equal to 8. 
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Cache memory 15 is written with S samples from waveform 
memory 1, the last of which is located at the integer part 3 of 
the current memory address. The waveform memory addresses to 
accomplish this are generated by memory access unit 16, which can 
5 operate independently and asynchronously from the address update 
and convolution units. Memory access unit 16 also generates the 
cache write address 17 at which the samples are stored. 

Ultimately, the required samples 18 are read from the 
cache memory at sequential addresses, the last of which is 
yio located at the truncated least significant portion of the integer 

part 3 of the current memory address, and supplied to the 
JJ; convolution unit for multiplication and accumulator with 

HI coefficients derived from the fractional part 4 of the current 

S| address. This system operates according to the algorithm 

qL5 previously described in patent application Serial No. 07/462,392. 

The method described in patent application Serial No. 
N= 07/462,392 has some limitations. Specifically, a dual port cache 

fy memory with separate write and read addresses .is required, and 

the time required to complete all waveform memory accesses in the 
20 worst case is limited, thus placing an unnecessary limit on the 
upward pitch shifting capability of the circuit. The present 
invention eliminates both these limitations, and is shown in 
block diagram form in figure 3. 

As in all interpolation systems, an address update unit 
25 2 produces a new current address for each output sample for each 
channel. However, in the present invention, the current address 
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for each channel is stored in an external address register file 
21, along with the cache size and cache base address for each 
channel. Address register file 21 is only used by address update 
unit 2 for two cycles (one to fetch the old current address and 
another to store the new current address) for each channel, so it 
is available for other accesses which are controlled by memory 
access unit 23 on the remaining cycles. In the preferred 
embodiment, alternate cycles are available to the memory access 
unit, as determined by system state counter 26. 

Address update unit 2 produces a cache request signal 
29 for a given channel if the integer part of the new current 
address differs from the integer part of the old current address. 
This request indicates that a waveform memory access is required. 
It also produces a cache size (S) and cache base address (CBA) . 
The cache size is defined as above; the cache base address is a 
cumulating sum of the cache size, or: 

CBA„ = (Int(AJ - IntfA,,.,) + CBA^j) mod M 
where M is the cache memory size in entries per channel. 

The need for a cache base address is twofold. First, 
the size of the cache is not an even power of two, the mechanism 
described previously in patent application Serial No. 07/462,392 
will not function because the mod operation in the equation above 
defining the CBA cannot be implicitly performed by truncation 
leaving only the least significant bits of the integer part of 
the current address. Thus a separate cache base address which 
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can be modulo M must be maintained. However, even when H is an 
even power of two, there is reason to maintain a separate cache 
base address. A typical feature of digital sampling musical 
instruments is the ability to loop a sound in waveform memory. 
5 This is typically accomplished by an algorithm implemented in the 
address update unit in which the new current address is compared 
against a loop end address. If the end address has been 
exceeded, the size of the loop is subtracted to begin the loop of 
sound again near the start. This technique is well known to 

,10 those skilled in the art. Because the size of the loop is not 

h 

g necessarily an integer multiple of the cache size, the cache 

address cannot be identical with the least significant bits of 
yl the current address if a loop is implemented. Instead, a 

%i separate cache base address must be maintained. When a loop 

j^L5 occurs, the current address is updated according to a 
M<j conventional looping algorithm, but the cache base address is 

M= simply increased modulo M by the phase increment as with an 

fy unlooped address update. 

In Figure 3, priority unit 22 determines from the cache 
20 requests of all channels which channel's request should be 

honored first. It then supplies that priority channel's number 
27 to memory access unit 23 and to address register file 21. An 
idle signal 24 is also supplied to indicate that no requests are 
pending, and an accept signal 25 causes the priority unit to 
25 reset a channel's cache request as having been accepted for 
service. 
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Memory Access Unit 23 responds to a request for a given 
channel's service by asserting accept signal 25 after acquiring 
the channel number from priority unit 22, and the current 
address, cache size, and cache base address for that channel from 
the address register file 21. For each waveform memory location, 
beginning at the current address and decrementing until S 
locations have been accessed, waveform memory 1 is accessed and 
the sample located at the indicated address is placed into cache 
memory 30. The first fetched sample is placed at the cache base 
address for the channel, and subsequent samples, if any, are 
placed at successively decreasing locations module M. 

The Convolution unit 7 behaves similarly to typical 
interpolation systems, responding to the fractional part 4 of the 
old current address to determine a set of N coefficients for N 
point interpolation. The old cache base address 28 is used to 
generate the starting location of the sample points in the cache 
memory, from which the sum of products of samples and 
coefficients is generated. 

Master state counter 26 provides information used by 
all units as to the channel number under service and the 
processing state for that channel. 

Now that the overall structure of the present invention 
has been explained, the details of each of the elements will be 
explained in detail. 

The address update unit for the preferred embodiment 
shown in figure 4 is similar to that required for most 
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interpolators, and should be familiar to those skilled in the 
art. Constants for control of the current address are stored in 
register file memory 51, which is accessed according to address 
and read/write signals 60 and 61 supplied by control logic block 
5 52, which derives the register file access pattern from the 

master state counter. For cycles in which a register file access 
is not required, initial data can be written into the register 
file as requested by a controlling microprocessor. Constants 
accessed from register file 51 are loaded into register 53 for 
L 10 manipulation by the address update ALU 57. 

For example, the phase increment P is loaded into 
register 53 during a particular cycle. Simultaneously, data from 

s 

01 the address register file is loaded into register 54. 

%4 Multiplexers 55 and 56 select appropriate fields of the data thus 

Pp.5 present in the registers to be manipulated by ALU 57 to produce 
LH the new current address and cache base address which are clocked 

H into register 58. Multiple cycles are typically required in 

O 

fU order to provide for sound looping by comparison of the results 

of the incremented current address to a loop end address and 

20 conditional subtraction of a loop size. At some value of the 

state counter, the completely valid new current address and cache 
base address are available in register 58 and can be re-written 
into the address register file along with the cache size produced 
by cache request/ size logic block 59. This is repeated for each 

25 of the L channels to be processed according to the most 
significant portion of the state counter. 
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Other outputs from the address update unit are 
fractional part of the old current address and the old CBA. 
These are available from register 54 which has latched this data 
when accessed from the address register file prior to update. 
Another output is the cache request signal, which is derived 
from phase increment, old current address, old cache size, and 
any previously unhonored request from this channel, by cache 
request/size logic block 59. 

The cache size for the preferred embodiment is slightly 
more complex than the equation for S above due to two 
complicating factors. First, in the preferred embodiment, there 
are two levels of priority for cache requests, low and high, as 
explained below. It is possible for a low priority request to 
remain unhonored for an entire sample period. In this case, the 
unhonored request must continue to persist, and be added to the 
cache size as an additional cache entry to be fetched. 

Furthermore, for reasons of channel startup, it is 
necessary that if the previous cache size was set to zero by the 
controlling processor, that any pending low priority request be 
ignored. Thus the logic for cache size follows the equation: 

S = IntCAJ - Int(A n .,) + (OldS! = 0) * OldP 
where OldS is the old cache size, and OldP is the state of the 
low priority bit for this channel from the previous sample 
period. Note that S must be saturated to never exceed M, the 
number of cache entries per channel. 
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A low priority cache request is generated if S is equal 
to one. A high priority cache request is generated if S is 
greater than one. 

One skilled in the art will note that there is 
5 redundancy in the definitions of S and the low priority and high 
priority signals. In the particular case of M (the number of 
cache entries per channel) being an even power of two, and N (the 
number of interpolation points) being M-l, the required values of 
S are zero to an even power of two, which must be very 
Hlo inefficiently encoded in log2M+l bits. Encoding S=0 indicating 
0 zero cache entries required, S=l to indicate either 1 or 2 

X cache entries required depending on the priority (low=l, high=2) 

J?! and S>1 indicates S+l entries required reduces the bit 

N requirement to an efficient log2M bits. 

Cl5 The address register file shown in figure 5 is once 

ry again a block that should be understood by those skilled in the 

^ art. Register file memory 61 contains a location for each 

=-y channel, in which is stored the current address, cache size, and 

cache base address. On alternate cycles as based on the master 
20 state counter, access to the memory is given to the memory access 
unit by reading the address indicated by the channel number 
provided by the priority logic. The remaining cycles can be 
allocated to reading and subsequently writing the data for the 
channel under service by the address update unit, whose channel 
25 number is determined from the most significant part of the state 
counter, or to writing initialization data into the register file 
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memory from a controlling processor. The selection of the write 
data from controlling processor or address update unit is 
performed by multiplexer 63, which along with register file 
address and read/write is controlled by control logic block 62 
5 which derives its sequence from the master state counter. 

The priority unit is shown in detail in figure 6. 
Before the function of the priority unit can be explained, some 
further clarification of the function of this circuit is 
necessary. 

3.0 While the minimum size for a cache memory is M=N 

entries per channel for an N point interpolator, if the cache 

CI memory is made larger, for example N+l locations per channel, a 

pj significant benefit can be achieved. 

Consider the typical operation of the cache memory and 
address update unit. The address update unit will determine the 

rU current address (both integer and fractional part) and initiate 

Q the convolution cycle for the corresponding output point. Note 

that this is why the old fractional part and CBA are used by the 
convolution unit and cache memory control circuit. The address 
20 update unit will then add the phase increment to the current 
address to produce a new address. If the new address has 
changed in integer part from the old address (S != 0), the memory 
access unit is requested to fetch the data from the new address 
and any intervening addresses into the cache. This must be done 
25 before the convolution cycle for the next output point, if only N 
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locations are present in the cache, because all N must be valid 
for an N point interpolator. 

Since each channel is operating with a phase increment 
of arbitrary fractional part, the number of waveform memory 
cycles will jitter between two values for each channel. For 
example, if the phase increment were exactly 0.5, alternate 
cycles would require zero or one waveform memory access. The 
typical case would involve a much more complex pattern of 
accesses. However, it is clear that there is a "worst case" 
instance in which every channel would have its larger number of 
waveform memory accesses required. 

If the memory access unit were not capable of servicing 
all of these accesses, the cache would not be properly filled for 
some channel and an incorrect convolution would result. Thus the 
maximum upward pitch shift capability for such a unit with N 
locations per channel for an N point interpolator, L channels, 
and M waveform memory accesses per sample period, will be such 
that the sum of the integer parts of the phase increments for all 
the channels is less than M-L. Thus, for example, if M equals 64 
and L equals 64, then no pitch shift can exceed unity. 

The above situation can be alleviated by the use of an 
oversized cache memory and a priority circuit. If the cache 
memory has one more location than the number of points of 
interpolation, then one cache memory location can remain invalid 
for more than one cycle. Depending on the phase increment for 
that particular channel, the waveform memory access for that 
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channel can be postponed for one or more cycles while accesses 
which are required more quickly are performed. It can be shown 
that in this case, the sum of the phase increments themselves 
cannot exceed M. This is a considerably better situation than 
that above. 

The priority circuit must thus distinguish between two 
types of memory access unit requests, those which can be serviced 
later, and those which must be serviced before the next cycle. 
Within each category, a priority is established in which as a 
channel gets closer to its next service by the address update 
unit, it gets higher priority. Such a circuit is shown in figure 
6. 

There are two cache service request lines, one of low 
and one of high priority. The generation of these by the address 
logic has been explained above. A shift register 70 is enabled 
once for each channel, and thus maintains a bit for each channel 
for each of these two priority levels. The 2N outputs of the 
shift registers are routed into a standard priority encoder 71, 
whose output 72 is the number of the highest active input of high 
priority, if any, or if none, then the number of the highest 
active input of low priority. If no inputs are active, an idle 
flag 73 is asserted. The priority channel and idle flag are 
latched in register 74. Because the requests are being shifted 
within the shift register for each new channel processed, the 
number in register 74 must be added to the current active channel 
number 75 as provided by the master state counter to give the 
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current priority channel. This sum at the output of adder 76 is 
stored in enabled register 77, and supplied to the address file 
as the memory access unit channel. 

When enabled register 77 becomes enabled with non-idle 
data, a new priority channel has been accepted for service, and 
that channel's priority can thus be reset. This is determined by 
gate 78 whose output becomes active only if the output of 
register 74 does not correspond to an idle priority, and either 
the output of enable register 77 is idle, or the memory access 
unit is acknowledging acceptance of the channel indicated by 
enabled register 77. In this case, decoder 79 is enabled and 
resets both shift register bits corresponding to the channel 
which is indicated by register 74. Enabled register 77 is 
disabled after it has accepted the new channel because the memory 
access unit will negate its accept line and is re-enabled when 
the memory access unit has completed its processing of the 
previous channel and hence asserts the accept signal again. 

Figure 7 shows the details of the memory access unit. 
When a new priority channel 80 becomes valid from the priority 
unit as indicated by idle signal 81 becoming low, these signals 
along with the current address integer part, cache size, and 
cache base address for the channel as supplied by the address 
register file are each latched into enabled register 82 which is 
enable during cycles when the memory access unit has had an 
access to the address register file as determined by control 
logic 83 from the master state counter. AND gate 84 then accepts 
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the data from enabled register 82, and initiates a set of memory 
access cycles if the data in register 82 is valid (not idle) and 
the memory access unit is not already busy. Control logic 83 
begins a cycle by presetting down counters 85, 86, and 87 with 
the current address integer part, cache base address, and cache 
size respectively, and indicates the memory access unit is busy 
by asserting line 88. If the cache size is zero, the cycle is 
aborted. This is necessary for the startup condition on a 
channel so that the controlling processor, by setting cache size 
to zero, can halt any activity which will fill the cache. 

If the cycle is not aborted, the memory interface logic 
89 determines from the current address what type of memory is 
being used in this portion of the address space, and initiates an 
appropriate memory cycle by providing properly formatted address 
and control signals on waveform memory address and control lines 
90. 

Figure 11 shows a typical connection for multiple 
memory types. Note that some address and control lines can be 
shared by both types of memory (in this case, DRAM and ROM memory 
have been chosen) as illustrated by signals of group 110, while 
others are dedicated to either one or the other memory, such as 
group 111 for the DRAM, and 112 for the ROM. When the waveform 
memory data is present on the memory data bus 8 of Figure 7, it 
is latched along with the channel number and the CBA counter 
value in enabled register 91, and memory interface logic 89 
indicates to control logic 83 that the current memory cycle is 
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complete. Control logic 83 then asserts cache write request line 
92, indicating that the cache write data and cache address 
present in enabled register 91 have become valid, and also causes 
all three counters 85, 86, and 87 to be decremented. If the 
5 cache size in counter 87 has become zero, the request is complete 
and the busy line is brought inactive, enabling another request. 
If the cache size is non-zero, another memory cycle is initiated 
with the decremented current address and CBA. 

Note that dynamic memory can use page mode accesses for 
Q10 multiple memory cycles to adjacent memory locations, and that the 
^ logic to accomplish this is straightforwardly included into the 

memory interface logic. Also note that the asynchronous nature 
ill of the memory interface logic allows for optionally including 

s logic for bus arbitration signals 93 and 94 (which would 

pjl5 typically be a bus acknowledge and bus request respectively) 
ru which would allow sharing of a single waveform memory among 

0 multiple interpolator systems or chips. Naturally in such a 

ru 

case, the waveform memory address and control signals 90 would 
have to be capable of being output disabled by memory interface 

20 logic 89 when it was not the bus master. Such a multiple 
interpolator system is shown in figure 10. 

Details of the cache memory unit are shown in figure 8. 
Note that the silicon area required to implement dual port memory 
is nearly twice that require for single port memory. Hence it is 

25 cost effective to utilize single port memory whenever possible. 

Avoiding a dual port cache memory is accomplished by noting that 
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the convolution algorithm requires the sum of products, each of 
which comes from successive locations in the cache memory. In 
other words, for an N point interpolator, the output will be the 
sum of a first coefficient times a first point in the cache 
memory, plus a second coefficient times the next point in the 
cache memory, et cetera, up to the Nth coefficient times the 
final sequential cache memory datum. Consequently, the cache 
memory can be divided into two separate single port memories as 
shown in figure 8. Even cache memory 31 contains the even 
locations and odd cache memory 32 contains the odd locations. 
Multiplexer 33 routes the output data from the cache memory 
active for reading data to the convolution unit. The convolution 
unit will thus access alternate memories, leaving the unaccessed 
memory free for accepting data from the waveform memory as 
directed by the memory access unit. 

Control logic 34 determines from the read cache base 
address 35 supplied by the address update unit and the master 
state counter which memory will be active for reading during a 
given cycle. 

This determines the state of multiplexer 33 as well as 
that of address multiplexers 36 and 37, which determine if memory 
31 or 32 respectively is getting its address from the memory 
access unit or from the cache base address and channel number 
under service by the convolution unit. 

By allocating at least one cycle of the convolution 
circuit to a function other than the convolution itself, either 
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an idle cycle or one for scaling the volume of the resulting sum 
of products, one cache memory cycle per output point can require 
no read cycle and thus have both cache memories available for 
writing memory access unit data. As shown in the reservation 
5 table in table 1 (see Figure 12) this then guarantees that the 
proper cache memory is available for a write access from the 
memory access unit within no more than two cycles. At typical 
clock rates, this occurs faster than the access times of memories 
suitable for waveform storage. Control logic 34 thus responds to 
10 a cache write request on line 92 by writing the waveform memory 
data at the required cache memory address within two cycles. 

The details of the convolution unit are shown in figure 
9. Coefficient logic 100 derives the N coefficients as a 
function of current address fractional part 4 according to the 
15 master state counter as interpreted by control logic 101. This 
can be accomplished according to various algorithms, some of 
which are described in patent application Serial No. 07/462,392. 
Multiply/ accumulator 102 forms the sum of products required by 
the interpolation algorithm from the coefficients 103 times the 
2 0 cache memory data 104. Control logic 101 can reset the 

cumulating sum in the accumulator using AND gates 105 at the 
beginning of each new channel. The ultimate sum of products is 
then formatted, for example, into a serial data stream for 
further processing, by output formatting logic 106. 
25 Provision is also made, by use of multiplexers 107 and 

108, to use any idle cycles of the multiply/ accumulator (one of 
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which is necessary if single port cache memory has been used as 
described above) for alternate functions. For example, the 
output of the multiply/accumulator's final sum can be routed into 
one input of the multiplier by multiplexer 107, while a volume 
scaling number is routed into the other input by multiplexer 108, 
allowing the idle cycle to provide an individual volume control 
for each channel before the final sum of products is routed to 
output formatting logic 106. 

The foregoing descriptions of specific embodiments of 
the present invention have been presented for purposes of 
illustration and description. They are not intended to be 
exhaustive or to limit the invention to the precise forms 
disclosed, and it should be understood that many modifications 
and variations are possible in light of the above teaching. The 
embodiments were chosen and described in order to best explain 
the principles of the invention and its practical application, to 
thereby enable others skilled in the art to best utilize the 
invention and various embodiments with various modifications as 
are suited to the particular use contemplated. It is intended 
that the scope of the invention be defined by the Claims appended 
hereto and their equivalents. 
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